299 research outputs found

    Theano: new features and speed improvements

    Full text link
    Theano is a linear algebra compiler that optimizes a user's symbolically-specified mathematical computations to produce efficient low-level implementations. In this paper, we present new features and efficiency improvements to Theano, and benchmarks demonstrating Theano's performance relative to Torch7, a recently introduced machine learning library, and to RNNLM, a C++ library targeted at recurrent neural networks.Comment: Presented at the Deep Learning Workshop, NIPS 201

    An Integration-Oriented Ontology to Govern Evolution in Big Data Ecosystems

    Full text link
    Big Data architectures allow to flexibly store and process heterogeneous data, from multiple sources, in their original format. The structure of those data, commonly supplied by means of REST APIs, is continuously evolving. Thus data analysts need to adapt their analytical processes after each API release. This gets more challenging when performing an integrated or historical analysis. To cope with such complexity, in this paper, we present the Big Data Integration ontology, the core construct to govern the data integration process under schema evolution by systematically annotating it with information regarding the schema of the sources. We present a query rewriting algorithm that, using the annotated ontology, converts queries posed over the ontology to queries over the sources. To cope with syntactic evolution in the sources, we present an algorithm that semi-automatically adapts the ontology upon new releases. This guarantees ontology-mediated queries to correctly retrieve data from the most recent schema version as well as correctness in historical queries. A functional and performance evaluation on real-world APIs is performed to validate our approach.Comment: Preprint submitted to Information Systems. 35 page

    Linear Sampling Method applied to Non Destructive Testing of an elastic waveguide: theory, numerics and experiments

    Get PDF
    International audienceThis paper presents an application of the Linear Sampling Method to ultrasonic Non Destructive Testing of an elastic waveguide. In particular, the NDT context implies that both the solicitations and the measurements are located on the surface of the waveguide and are given in the time domain. Our strategy consists in using a modal formulation of the Linear Sampling Method at multiple frequencies, such modal formulation being justified theoretically in [1] for rigid obstacles and in [2] for cracks. Our strategy requires the inversion of some emission and reception matrices which deserve some special attention due to potential ill-conditioning. The feasibility of our method is proved with the help of artificial data as well as real data

    Deep Self-Taught Learning for Handwritten Character Recognition

    Full text link
    Recent theoretical and empirical work in statistical machine learning has demonstrated the importance of learning algorithms for deep architectures, i.e., function classes obtained by composing multiple non-linear transformations. Self-taught learning (exploiting unlabeled examples or examples from other distributions) has already been applied to deep learners, but mostly to show the advantage of unlabeled examples. Here we explore the advantage brought by {\em out-of-distribution examples}. For this purpose we developed a powerful generator of stochastic variations and noise processes for character images, including not only affine transformations but also slant, local elastic deformations, changes in thickness, background images, grey level changes, contrast, occlusion, and various types of noise. The out-of-distribution examples are obtained from these highly distorted images or by including examples of object classes different from those in the target test set. We show that {\em deep learners benefit more from out-of-distribution examples than a corresponding shallow learner}, at least in the area of handwritten character recognition. In fact, we show that they beat previously published results and reach human-level performance on both handwritten digit classification and 62-class handwritten character recognition

    Đ“Đ”ĐœĐŽĐ”Ń€ĐœĐžĐč аспДĐșт ĐČĐžĐČŃ‡Đ”ĐœĐœŃ Đ¶Đ°Ń€ĐłĐŸĐœĐœĐŸŃ— лДĐșсОĐșĐž

    Get PDF
    В ŃŃ‚Đ°Ń‚ŃŒĐ” ĐŸĐżŃ€Đ”ĐŽĐ”Đ»ŃĐ”Ń‚ŃŃ ĐżĐŸĐœŃŃ‚ĐžĐ” "ĐŸĐ±Ń‰ĐžĐč Đ¶Đ°Ń€ĐłĐŸĐœ", ĐŸĐ±ĐŸŃĐœĐŸĐČыĐČĐ°Đ”Ń‚ŃŃ Đ°ĐșŃ‚ŃƒĐ°Đ»ŃŒĐœĐŸŃŃ‚ŃŒ ĐłĐ”ĐœĐŽĐ”Ń€ĐœĐŸĐłĐŸ ĐżĐŸĐŽŃ…ĐŸĐŽĐ° Đș ĐžĐ·ŃƒŃ‡Đ”ĐœĐžŃŽ Đ¶Đ°Ń€ĐłĐŸĐœĐžĐ·ĐŒĐŸĐČ. На ĐŒĐ°Ń‚Đ”Ń€ĐžĐ°Đ»Đ” ĐČŃ‹Đ±ĐŸŃ€ĐŸĐș Оз ŃĐ»ĐŸĐČарДĐč, Ń…ŃƒĐŽĐŸĐ¶Đ”ŃŃ‚ĐČĐ”ĐœĐœĐŸĐč Đ»ĐžŃ‚Đ”Ń€Đ°Ń‚ŃƒŃ€Ń‹, ĐżĐŸ Ń€Đ”Đ·ŃƒĐ»ŃŒŃ‚Đ°Ń‚Đ°ĐŒ ĐżŃ€ĐŸĐČĐ”ĐŽĐ”ĐœĐœĐŸĐłĐŸ Đ°ĐČŃ‚ĐŸŃ€ĐŸĐŒ ĐżŃĐžŃ…ĐŸĐ»ĐžĐœĐłĐČОстОчДсĐșĐŸĐłĐŸ эĐșŃĐżĐ”Ń€ĐžĐŒĐ”ĐœŃ‚Đ° ĐŸŃŃƒŃ‰Đ”ŃŃ‚ĐČĐ»ŃĐ”Ń‚ŃŃ ŃĐ”ĐŒĐ°ĐœŃ‚ĐžŃ‡Đ”ŃĐșĐžĐč Đ°ĐœĐ°Đ»ĐžĐ·, ĐŸĐżŃ€Đ”ĐŽĐ”Đ»ŃŃŽŃ‚ŃŃ ŃĐżĐŸŃĐŸĐ±Ń‹ ĐŸĐ±Ń€Đ°Đ·ĐŸĐČĐ°ĐœĐžŃ Đ¶Đ°Ń€ĐłĐŸĐœĐžĐ·ĐŒĐŸĐČ ĐœĐ° ĐŸĐ±ĐŸĐ·ĐœĐ°Ń‡Đ”ĐœĐžĐ” Đ¶Đ”ĐœŃ‰ĐžĐœ ĐČ ŃƒĐșŃ€Đ°ĐžĐœŃĐșĐŸĐŒ ŃĐ·Ń‹ĐșĐ”.ĐŁ статті ĐČĐžĐ·ĐœĐ°Ń‡Đ°Ń”Ń‚ŃŒŃŃ ĐżĐŸĐœŃŃ‚Ń‚Ń "Đ·Đ°ĐłĐ°Đ»ŃŒĐœĐžĐč Đ¶Đ°Ń€ĐłĐŸĐœ", ĐŸĐ±Ò‘Ń€ŃƒĐœŃ‚ĐŸĐČується Đ°ĐșŃ‚ŃƒĐ°Đ»ŃŒĐœĐŸŃŃ‚ŃŒ Ò‘Đ”ĐœĐŽĐ”Ń€ĐœĐŸĐłĐŸ ĐżŃ–ĐŽŃ…ĐŸĐŽŃƒ ĐŽĐŸ ĐČĐžĐČŃ‡Đ”ĐœĐœŃ Đ¶Đ°Ń€ĐłĐŸĐœŃ–Đ·ĐŒŃ–ĐČ. На ĐŒĐ°Ń‚Đ”Ń€Ń–Đ°Đ»Ń– ĐČĐžĐ±Ń–Ń€ĐŸĐș Đ·Ń– ŃĐ»ĐŸĐČĐœĐžĐșіĐČ, Ń…ŃƒĐŽĐŸĐ¶ĐœŃŒĐŸŃ— Đ»Ń–Ń‚Đ”Ń€Đ°Ń‚ŃƒŃ€Đž, Đ·Đ° Ń€Đ”Đ·ŃƒĐ»ŃŒŃ‚Đ°Ń‚Đ°ĐŒĐž ĐżŃ€ĐŸĐČĐ”ĐŽĐ”ĐœĐŸĐłĐŸ Đ°ĐČŃ‚ĐŸŃ€ĐŸĐŒ ĐżŃĐžŃ…ĐŸĐ»Ń–ĐœĐłĐČŃ–ŃŃ‚ĐžŃ‡ĐœĐŸĐłĐŸ Đ”ĐșŃĐżĐ”Ń€ĐžĐŒĐ”ĐœŃ‚Ńƒ Đ·ĐŽŃ–ĐčŃĐœŃŽŃ”Ń‚ŃŒŃŃ ŃĐ”ĐŒĐ°ĐœŃ‚ĐžŃ‡ĐœĐžĐč Đ°ĐœĐ°Đ»Ń–Đ·, ĐČĐžĐ·ĐœĐ°Ń‡Đ°ŃŽŃ‚ŃŒŃŃ ŃĐżĐŸŃĐŸĐ±Đž утĐČĐŸŃ€Đ”ĐœĐœŃ Đ¶Đ°Ń€ĐłĐŸĐœŃ–Đ·ĐŒŃ–ĐČ ĐœĐ° ĐżĐŸĐ·ĐœĐ°Ń‡Đ”ĐœĐœŃ Đ¶Ń–ĐœĐŸĐș ĐČ ŃƒĐșŃ€Đ°Ń—ĐœŃŃŒĐșіĐč ĐŒĐŸĐČі.The author of the article defines the concept "general jargon", bases actuality of gender aspect of jargon words investigation. Using dictionaries, literature and the results of psycho-linguistic experiment, the author analyses semantic and formation ways of jargon words for designation of women in Ukrainian

    Ontogenic Changes in Hematopoietic Hierarchy Determine Pediatric Specificity and Disease Phenotype in Fusion Oncogene-Driven Myeloid Leukemia.

    Get PDF
    Fusion oncogenes are prevalent in several pediatric cancers, yet little is known about the specific associations between age and phenotype. We observed that fusion oncogenes, such as ETO2-GLIS2, are associated with acute megakaryoblastic or other myeloid leukemia subtypes in an age-dependent manner. Analysis of a novel inducible transgenic mouse model showed that ETO2-GLIS2 expression in fetal hematopoietic stem cells induced rapid megakaryoblastic leukemia whereas expression in adult bone marrow hematopoietic stem cells resulted in a shift toward myeloid transformation with a strikingly delayed in vivo leukemogenic potential. Chromatin accessibility and single-cell transcriptome analyses indicate ontogeny-dependent intrinsic and ETO2-GLIS2-induced differences in the activities of key transcription factors, including ERG, SPI1, GATA1, and CEBPA. Importantly, switching off the fusion oncogene restored terminal differentiation of the leukemic blasts. Together, these data show that aggressiveness and phenotypes in pediatric acute myeloid leukemia result from an ontogeny-related differential susceptibility to transformation by fusion oncogenes. SIGNIFICANCE: This work demonstrates that the clinical phenotype of pediatric acute myeloid leukemia is determined by ontogeny-dependent susceptibility for transformation by oncogenic fusion genes. The phenotype is maintained by potentially reversible alteration of key transcription factors, indicating that targeting of the fusions may overcome the differentiation blockage and revert the leukemic state.See related commentary by Cruz Hernandez and Vyas, p. 1653.This article is highlighted in the In This Issue feature, p. 1631

    The GenTree Dendroecological Collection, tree-ring and wood density data from seven tree species across Europe

    Get PDF
    The dataset presented here was collected by the GenTree project (EU-Horizon 2020), which aims to improve the use of forest genetic resources across Europe by better understanding how trees adapt to their local environment. This dataset of individual tree-core characteristics including ring-width series and whole-core wood density was collected for seven ecologically and economically important European tree species: silver birch (Betula pendula), European beech (Fagus sylvatica), Norway spruce (Picea abies), European black poplar (Populus nigra), maritime pine (Pinus pinaster), Scots pine (Pinus sylvestris), and sessile oak (Quercus petraea). Tree-ring width measurements were obtained from 3600 trees in 142 populations and whole-core wood density was measured for 3098 trees in 125 populations. This dataset covers most of the geographical and climatic range occupied by the selected species. The potential use of it will be highly valuable for assessing ecological and evolutionary responses to environmental conditions as well as for model development and parameterization, to predict adaptability under climate change scenarios

    Are we ready to track climate-driven shifts in marine species across international boundaries? - A global survey of scientific bottom trawl data

    Get PDF
    Marine biota are redistributing at a rapid pace in response to climate change and shifting seascapes. While changes in fish populations and community structure threaten the sustainability of fisheries, our capacity to adapt by tracking and projecting marine species remains a challenge due to data discontinuities in biological observations, lack of data availability, and mismatch between data and real species distributions. To assess the extent of this challenge, we review the global status and accessibility of ongoing scientific bottom trawl surveys. In total, we gathered metadata for 283,925 samples from 95 surveys conducted regularly from 2001 to 2019. We identified that 59% of the metadata collected are not publicly available, highlighting that the availability of data is the most important challenge to assess species redistributions under global climate change. Given that the primary purpose of surveys is to provide independent data to inform stock assessment of commercially important populations, we further highlight that single surveys do not cover the full range of the main commercial demersal fish species. An average of 18 surveys is needed to cover at least 50% of species ranges, demonstrating the importance of combining multiple surveys to evaluate species range shifts. We assess the potential for combining surveys to track transboundary species redistributions and show that differences in sampling schemes and inconsistency in sampling can be overcome with spatio-temporal modeling to follow species density redistributions. In light of our global assessment, we establish a framework for improving the management and conservation of transboundary and migrating marine demersal species. We provide directions to improve data availability and encourage countries to share survey data, to assess species vulnerabilities, and to support management adaptation in a time of climate-driven ocean changes.En prensa6,86

    TRY plant trait database – enhanced coverage and open access

    Get PDF
    Plant traits - the morphological, anatomical, physiological, biochemical and phenological characteristics of plants - determine how plants respond to environmental factors, affect other trophic levels, and influence ecosystem properties and their benefits and detriments to people. Plant trait data thus represent the basis for a vast area of research spanning from evolutionary biology, community and functional ecology, to biodiversity conservation, ecosystem and landscape management, restoration, biogeography and earth system modelling. Since its foundation in 2007, the TRY database of plant traits has grown continuously. It now provides unprecedented data coverage under an open access data policy and is the main plant trait database used by the research community worldwide. Increasingly, the TRY database also supports new frontiers of trait‐based plant research, including the identification of data gaps and the subsequent mobilization or measurement of new data. To support this development, in this article we evaluate the extent of the trait data compiled in TRY and analyse emerging patterns of data coverage and representativeness. Best species coverage is achieved for categorical traits - almost complete coverage for ‘plant growth form’. However, most traits relevant for ecology and vegetation modelling are characterized by continuous intraspecific variation and trait–environmental relationships. These traits have to be measured on individual plants in their respective environment. Despite unprecedented data coverage, we observe a humbling lack of completeness and representativeness of these continuous traits in many aspects. We, therefore, conclude that reducing data gaps and biases in the TRY database remains a key challenge and requires a coordinated approach to data mobilization and trait measurements. This can only be achieved in collaboration with other initiatives
    • 

    corecore